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Question 6 
Intent of Question 


The primary goals of this question were to assess a student’s ability to (1) use a scatterplot to comment on 
a report about the relationship between two variables and interpret the slope for the least-squares 
regression line summarizing this relationship; (2) describe the relationship between two variables in a 
scatterplot when a categorical variable is introduced and compare a characteristic of the distribution of a 
variable for different categories of individuals in a scatterplot; and (3) describe how the associations 
between two variables for each category of individuals in a scatterplot differ from the overall association in 
the same scatterplot. 


Solution 
Part (a): 


The scatterplot supports the newspaper report about number of semesters needed to complete an 
academic program and starting salary because it shows a positive association between these two 
variables. 


Part (b): 


The slope is 1.1594. For each additional semester needed to complete an academic program, the 
predicted starting salary increases by €1,159.40. 


Part (c): 


For the business majors alone, there is a strong, negative, linear association between number of 
semesters and starting salary. Business majors who need a greater number of semesters to complete 
an academic program tend to have lower starting salaries. 


Part (d): 


Business majors have the lowest median starting salary at around €38,000, followed by physics majors 
at around €48,000, and then chemistry majors with the highest median starting salary at around 
£95,000. 


Part (e): 


The newspaper report should be modified to account for major. Overall, majors that take longer to 
complete tend to have higher starting salaries, with chemistry the highest, physics the next highest, 
and business the lowest. However, within a major, students who take a greater number of semesters 
tend to have lower starting salaries. 


Scoring 


This question is scored in three sections. Section 1 consists of parts (a) and (b), section 2 consists of parts 
(c) and (d), and section 3 consists of part (e). Sections 1, 2, and 3 are scored as essentially correct (E), 
partially correct (P), or incorrect (I). 
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Question 6 (continued) 


Section 1 is scored as follows: 


Essentially correct (E) if the response includes the following five components: 


‘i 
Li 


2: 


In part (a) the response addresses the positive association. 

In part (a) the response uses the positive association to justify that the scatterplot supports the 
newspaper report. 

In part (b) the response correctly identifies the numerical value of the slope from the computer 
output. 

In part (b) the response interprets the slope as the change in starting salary for each additional 
semester, in context. 

In part (b) the interpretation of slope includes nondeterministic language (e.g., “predicted 
Starting salary” or equivalent) when interpreting the slope. 


Partially correct (P) if the response includes three or four of the five components. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 


In part (a) the response can use phrases such as “positive association (correlation, 
relationship),” “increasing relationship,” or describe a positive association (e.g., “the starting 
salaries are higher when there is a greater number of semesters”) to satisfy component 1. 
However, describing the relationship between only two points does not satisfy this component. 
In part (a) comments about linearity and strength should be ignored, even if the response 
implies these are required (e.g., “Yes, because the relationship is strong, positive, and linear.”). 
In part (a) responses that answer “no,” “maybe,” “kind of,” “somewhat,” or equivalent do not 
satisfy component 2. 

In part (a) a response that says “no, because the three clusters of points each have a negative 
association” does not satisfy component 1 or component 2. 

In part (a) no context is required, but variable names are required in part (b) to satisfy 
component 4. 

In part (b) a response that states the equation y = 34.018 + 1.1594x does not satisfy 
component 3 unless the slope is specifically identified or used in the interpretation. 

In part (b) a response that incorrectly identifies the numerical value of the slope can still satisfy 
components 4 and 5 using the incorrect value. 

In part (b) the 1-unit increase in number of semesters must be stated or implied to satisfy 
component 4 (e.g., for each semester, for every semester). A response that states or implies an 
unspecified number of semesters does not satisfy this component (for example, as semesters 
increase). 

In part (b) examples of nondeterministic language include “predicted starting salary,” 
“expected starting salary,’ “estimated starting salary,’ “typical starting salary,” “average 
starting salary,’ “starting salary, on average,’ “our model predicts,” and so on. However, 
“about,” “approximately,” and “according to the model” do not satisfy component b. 

In part (b) no units are required for the change in predicted starting salary (which means it is 
OK to say 1.1594 euros, 1159.40, 1.1594, 1,159.40 dollars, and so on). 


In all parts it is acceptable if a response refers to salary rather than starting salary. 


Pn bk 
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Question 6 (continued) 
Section 2 is scored as follows: 


Essentially correct (E) if the response includes the following five components: 

In part (c) the response states that the association is negative. 

In part (c) the response states that the association is strong OR linear OR both. 

In part (c) the response refers to both variables (semesters, salary) in context. 

In part (d) the response correctly compares the three majors. 

In part (d) the response provides reasonable values for the median salaries or refers to “median 
starting salaries” when describing a characteristic of the graph. 


Ot ee 


Partially correct (P) if the response includes three or four of the five components. 
Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 

e In part (c) the response can use phrases such as “negative association (correlation, 
relationship),” “decreasing relationship,” “inversely related,” or describe a negative association 
(for example, “the starting salaries are smaller when there is a greater number of semesters”) to 
satisfy component 1. However, describing the relationship between only two points does not 
satisfy this component. 

e In part (c) responses that describe the relationship incorrectly as weak or nonlinear do not 
satisfy component 2, even if the other characteristic is described correctly. 

e Only drawing a line on the scatterplot does not satisfy component 2. 

e In part (d) aresponse that says “chemistry has the highest median starting salary and business 
has the lowest median starting salary” (or the equivalent) implies that physics is in the middle, 
and satisfies component 4. 

e In part (d) if no values are provided for the medians, the response must use the phrase “median 
starting salary” at least once to satisfy component 5. 

e Inall parts it is acceptable if a response refers to salary rather than starting salary. 


Section 3 is scored as follows: 


Essentially correct (E) if the response states that there is a negative association for each of the majors 
AND the response notes the overall positive association. 


Partially correct (P) if the response states that there is a negative association for each of the majors 
BUT does not note the overall positive association: 

OR 
if the response states that there is a negative association for one or two specific majors (for example, 
for business majors) AND the response notes the overall positive association. 


Incorrect (I) if the response does not meet the criteria for E or P. 


Notes: 
e Additional ways to note the overall positive association include stating that the original report 
was correct, stating that majors that take longer to complete tend to have higher starting 
salaries, or the equivalent. 
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Question 6 (continued) 


A response that does not explicitly name the majors or refer to the three majors but makes a 
general statement such as “However, for an academic major, there is a negative association” 
satisfies the requirement to state the negative association for each of the majors. 

A response that states the original report was wrong, or incorrect, or should be retracted, etc., 


cannot satisfy the requirement to note the overall positive association. However, a response 
that says the original report might be misleading (or the equivalent) but still notes the overall 
positive association satisfies the requirement to note the overall positive association. 
In all parts it is acceptable if a response refers to salary rather than starting salary. 
Complete Response 
All three sections essentially correct 


Substantial Response 


Two sections essentially correct and one section partially correct 


OR 

sections 1 and 2 partially correct and section 3 essentially correct 
OR 

sections 1 and 3 essentially correct and section 2 incorrect 
OR 


sections 2 and 3 essentially correct and section 1 incorrect 
Developing Response 


sections 1 and 2 essentially correct and section 3 incorrect 


OR 

section 1 essentially correct and sections 2 and 3 partially correct 
OR 

sections 1 and 3 partially correct and section 2 essentially correct 
OR 

One section essentially correct and one section partially correct 
OR 


All three sections partially correct 
Minimal Response 


One section essentially correct and two sections incorrect 
OR 

Two sections partially correct and one section incorrect 
OR 

One section partially correct and two sections incorrect 
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(a) Does the scatterplot support the newspaper report about number of semesters and starting salary’ Justify 
your answer. 
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The table below shows computer output from a linear regression analysis on the data, 


Predictor Coef SE Coef T 
Constant 34,018 4.455 7.64 


Semesters 1.1594 0.3482 3.33 


§ = 7.37702 R-Sq = 33.5% R-Sq(adj) = 30.5% 





(b) Identify the slope of the least-squares regression line, and interpret the slope im context. 


Y= \.1594x + 34.08. 
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any part of this page is —. GO ON TO THE NEXT PAGE. 
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An independent researcher received the data from the newspaper and conducted a new analysis by separating the 
data into three groups based on the mayor of each person. A revised scatterplot identifying the major of each © 


person is shown below, 
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(c) Based on the people in the sample, describe the association between starting salary and number of semesters 
for the business majors. 
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(d) Based on the people m the sample, compare the median starting salaries for the three majors. 
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(e) Based on the analysis conducted by the independent researcher, how could the newspaper report be modified @ 3 
to give a better description of the relationship between the number of semesters and the starting salary for 
the people in the sample? | 


Majors that typically take longer To complete 
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STOP 


END OF EXAM 


THE FOLLOWING INSTRUCTIONS APPLY TO THE COVERS OF THE 
SECTION If BOOKLET. 


e MAKE SURE YOU HAVE COMPLETED THE IDENTIFICATION 
INFORMATION. AS REQUESTED ON THE FRONT AND BACK 
COVERS OF THE SECTION If BOOKLET. 


e CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX ON THE COVER. 


e MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. 


Unauthorized copying or reuse of 
any part of this page is illegal. 
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. (a) Does.the:scatterplot support the newspaper:report about number of semesters and starting salary? Justify 
your answer.” : a * 
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The table below shows computer output from a linear regression analysis on. the data. 
Predictor Coef SE Coet. T 


< 


Constant 34,018 4.455 7.64 


Semesters 1.1594 0.3482 3.33 





S = 7.37702 R-Sq = 33.5% -R-Sq(adj) = 30.5% 


(b) Identify the slope of the least-squares regression line, and interpret the slope in context. 
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An: independent researcher received the data from the newspaper and.conducted a new-analysis by separating the 
data into three groups based on the major of each person. A revised scatterplot identifying the major of each 
person is shown below. 
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(c) Based on the people in the sample, describe the association between starting salary and number of semesters 
for the business majors. 
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(d) Based on the people im the sample, compare the median starting salaries for the three majors. 
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(e) Based on the analysis conducted by the independent researcher, how could the newspaper report be modified 
to give a better description of the relationship between the number of semesters and the starting salary for 


the people in the sample? 
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END OF F EXAM 


THE FOLLOWING INSTRUCTIONS APPLY TO THE COVERS OF THE 
SECTION ll BOOKLET... 


e MAKE SURE you HAVE COMPLETED THE IDENTIFICATION 
INFORMATION AS REQUESTED ON' THE FRONT AND BACK 
COVERS OF THE SECTION | BOOKLET. 


© CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX ON THE COVER. 


e MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. 


Unauthorized copying or reuse of 
.|-any part of this page is iljegal. 
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(a) Does the scatterplot support the newspaper report about number of semesters and starting salary? Justify 
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The table below shows computer output from a linear regression analysis on the data. 


Predictor Coef SE Coef , T 
Constant 34.018 4.455 7,64 


Semesters 1.1594 0.3482 3.03 





S = 7.37702 R-Sq = 33.5% R-Sq(adj) = 30.5% 


(b) Identify the slope of the least-squares regression line, and interpret the slope in context. 
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An independent researcher received the data from the newspaper and conducted a new analysis by separating the 
data into three groups based on the major of each person. A revised scatterplot identifying the major of each 


person is shown below. 
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(c) Based on the people 1 in the sample, describe the association between starting salary and number of semesters 
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(e) Based on the analysis conducted by the independent researcher, how could the newspaper report be modified 

to give a better description of the relationship between the number of semesters and the starting salary for | 
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END OF EXAM 





THE FOLLOWING INSTRUCTIONS APPLY TO THE COVERS OF THE 
SECTION Il BOOKLET. | , 


e MAKE SURE YOU HAVE COMPLETED THE IDENTIFICATION 
INFORMATION AS REQUESTED ON THE FRONT AND BACK 
COVERS OF THE SECTION II BOOKLET. 


® CHECK TO SEE THAT YOUR AP NUMBER LABEL APPEARS IN 
THE BOX ON THE COVER. . 


e MAKE SURE YOU HAVE USED THE SAME SET OF AP 
NUMBER LABELS ON ALL AP EXAMS YOU HAVE TAKEN 
THIS YEAR. 


Unauthorized copying or reuse of 
any part of this page Is illegal. 
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Question 6 
Overview 


The primary goals of this question were to assess a student's ability to (1) use a scatterplot to comment on 
a report about the relationship between two variables and interpret the slope for the least-squares 
regression line summarizing this relationship; (2) describe the relationship between two variables in a 
scatterplot when a categorical variable is introduced and compare a characteristic of the distribution of a 
variable for different categories of individuals in a scatterplot; and (3) describe how the associations 
between two variables for each category of individuals in a scatterplot differ from the overall association in 
the same scatterplot. 


Sample: 6A 
Score: 4 


In part (a) the response addresses the positive association (“positive correlation”) and uses the positive 
association to justify that the scatterplot supports the newspaper report (“Yes”), satisfying components 1 and 
2 of section 1. The comment about the correlation not being extremely strong is ignored because the 
intention of part (a) is to address the direction of the association. In part (b) the response identifies the 
numerical value of the slope in the interpretation, interprets the slope correctly as the change in starting 
salary for each additional semester, and includes nondeterministic language in the interpretation (“on 
average”), satisfying components 3, 4, and 5. Because the response satisfies all five components, section 1 
was scored as essentially correct. In part (c) the response states that the association is negative, states that 
the association is moderately strong, and uses both variable names (number of semesters, starting salary), 
satisfying components 1, 2, and 3 of section 2. In part (d) the response correctly compares the three majors 
and refers to “median starting salary,” satisfying components 4 and 5. Because the response satisfies all five 
components, section 2 was scored as essentially correct. In part (e) the response states that there is a 
negative association for each major (“taking longer than average ... in your major tends to lead to a lower 
starting salary”) and notes the overall positive association (“majors that typically take longer to complete tend 
to come with higher starting salaries”). Section 3 was scored as essentially correct. Because all three 
sections were scored as essentially correct, the response earned a score of 4. 


Sample: 6B 
Score: 3 


In part (a) the response identifies the positive association and uses the positive association to justify that the 
scatterplot supports the newspaper report (“Yes”), satisfying components 1 and 2 of section 1. In part (b) the 
response identifies the numerical value of the slope, interprets the slope correctly as the increase in starting 
salary as the number of semesters increases by 1, and includes nondeterministic language in the 
interpretation (“it is predicted that”), satisfying components 3, 4, and 5. Because the response satisfies all five 
components, section 1 was scored as essentially correct. In part (c) the response states that the association is 
negative and uses both variable names (number of semesters, starting salary), satisfying components 1 and 3 
of section 2. However, the response does not satisfy component 2 because there is no description of strength 
or form. In part (d) the response provides reasonable values for the median salaries, satisfying component 5. 
However, because the response does not actually compare the three majors, it does not satisfy component 4. 
Because the response satisfies only three of the five components, section 2 was scored as partially correct. In 
part (e) the response states that there is a negative association for each major (“in the second study, we see 
that the associations are actually negative”) and notes the overall positive association (“in the first newspaper 
report, the correlation seems to be positive”). Section 3 was scored as essentially correct. Because two 
sections were scored as essentially correct, and one section was scored partially correct, the response 
earned a score of 3. 
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Question 6 (continued) 


Sample: 6C 
Score: 2 


In part (a) the response identifies the positive association and uses the positive association to justify that the 
scatterplot supports the newspaper report (“Yes”), satisfying components 1 and 2 of section 1. In part (b) the 
response identifies the numerical value of the slope and interprets the slope correctly as the increase in 
starting salary for every one semester increase, satisfying components 3 and 4. However, because the 
response does not include nondeterministic language in the slope interpretation, component 5 is not 
satisfied. Because the response satisfies only four of the five components, section 1 was scored as partially 
correct. In part (c) the response states that the association is negative, states that the association is strong 
and linear, and uses both variable names (number of semesters, starting salary), satisfying components 1, 2, 
and 3 of section 2. In part (d) the response correctly compares the three majors (“business majors is the 
lowest ... physics majors have the next highest ... chemistry majors have the highest”) and provides 
reasonable values for the median salaries, satisfying components 4 and 5. Because the response satisfies all 
five components, section 2 was scored as essentially correct. In part (e) the response states that there is a 
negative association for each major (“show 3 seperate [sic] scatterplots ... show the negative/inverse 
relationship between number of semesters and starting salary”). However, because the response says that 
the article should be retracted, it does not satisfy the requirement to note the overall positive association. 
section 3 was scored as partially correct. Because section 2 was scored essentially correct, and sections 1 
and 3 were scored partially correct, the response earned a score of 2. 
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